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Abstract 

This study employed graph theory and machine learning analysis of multiparametric IVIRI data to improve characterization 
and prediction in autism spectrum disorders (ASD). Data from 127 children with ASD (13.5±6.0 years) and 153 age- and 
gender-matched typically developing children (14.5 ±5.7 years) were selected from the multi-center Functional Connectome 
Project. Regional gray matter volume and cortical thickness increased, whereas white matter volume decreased in ASD 
compared to controls. Small-world network analysis of quantitative IVIRI data demonstrated decreased global efficiency 
based on gray matter cortical thickness but not with functional connectivity IVIRI (fcMRI) or volumetry. An integrative model 
of 22 quantitative imaging features was used for classification and prediction of phenotypic features that included the 
autism diagnostic observation schedule, the revised autism diagnostic interview, and intelligence quotient scores. Among 
the 22 imaging features, four (caudate volume, caudate-cortical functional connectivity and inferior frontal gyrus functional 
connectivity) were found to be highly informative, markedly improving classification and prediction accuracy when 
compared with the single imaging features. This approach could potentially serve as a biomarker in prognosis, diagnosis, 
and monitoring disease progression. 
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Introduction 

Autism spectrum disorders (ASD) are a group of polygenetic 
developmental brain disorders with behavioral and cognitive 
impairment [1]. AfiFected individuals exhibit stereotypical repeti- 
tive movements, restricted interests, lack of impulse control, speech 
deficits, impaired intelligence and social skills compared to 
typically developing (TD) children [2]. The underlying neurolog- 
ical changes and their association with the clinical manifestations 
remain an active area of research. 

Imaging studies have been instrumental in mapping aberrant 
connections in the brains of ASD children. Several studies 
revealed both structural and functional connectivity deficits in 
ASD [3,4]. Structural connectivity derived from difiTusion tensor 
imaging in ASD children demonstrate increased diffusivity and/ or 
reduced fractional anisotropy in the long occipitofrontal fasciculus 
and inter-hemispheric corpus caUosal (e.g., minor and major 
forceps) commissure [5] , asymmetric and under-connected arcuate 
fasciculus language pathways [6,7], as well as reduced cerebellar- 
cortical interconnectivity [8]. Functional connectivity MRI 
(fcMRI) shows abnormal dis-inhibition of some subcortical circuits 
[9] , over- or under-connectivity in the superior temporal gyrus and 
amygdala [10]. The caudate nucleus, which plays an important 
role in behavior impulsivity control, novelty seeking trait, 
procedural skill learning, and memory function [11], has been 



reported to increase in volume and its functional connectivity to 
the cortices is enhanced in autistic adolescents and young adults 
[12,13]. These abnormal caudate volume and caudate-cortical 
connectivity have been associated with repetitive autistic behaviors 
[12,13]. 

A key clinical manifestation of ASD is impaired learning and 
social interactions [14]. Mirror mechanisms in frontoparietal and 
sensorimotor networks are involved in the imitation of others' 
actions in normal subjects [15], and have been suggested to be 
dysfunctional in children with ASD [16,17]. In addition, neural 
networks underlying reflective mentalization [18] as well as 
emotional and interoceptive awareness [19] may be impaired in 
ASD. The default mode network (DMN) that consists of the 
medial frontal, posterior cingulate gyri and the temporo-parietal 
connection (which is involved in self awareness, social integration, 
learning and memory functions [20] and is usually deactivated 
under cognitive-demanded task conditions) has been found to be 
deficient in ASD children as well [21]. Furthermore such DMN 
dysfunction has been associated with decreased volumes of the 
structures involved [21]. 

The inferior frontal gyrus (IFG), which is involved in high-level 
memory, language production and comprehension, and learning 
interactions [22-24], has been reported to be abnormal in ASD 
children. Within the IFG, the pars opercularis plays a role in social 
interactions including imitation control [25], while the pars 
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Figure 1. Regions witli significant volume percentage incre- 
ment (A, red color) in gray matter, and volume decrement in 
gray matter (A, blue color) as well as white matter volume 
reduction (B, blue color) in autism compared to controls 
(P<0.01). Significant cortical thickness increment in percentage 
change in autism compared to controls is shown in C with red color 
(P<0.01). Both medial and lateral views on sagittal surface in the left 
and right hemispheres of an individual control subject are shown for 
gray matter volume (A), white matter volume (B) and cortical thickness 
(C) comparison between autistic children and controls. The percentage 
change of volume in autism was calculated as the mean difference 
between volumes of two groups normalized with individual supraten- 
torial volume and scaled by the mean normalized volume of control 
group. 

doi:10.1371/journal.pone.0090405.g001 

triangularis is associated witti language comprehension [26] . Task- 
related fMRI results have demonstrated IFG abnormalities in 
ASD patients, including impaired social function and affective 
emotion [27,28]. FcMRI shows decreased functional connectivity 
in IFG inhibitory circuits, consistent with disruption of mirror 
neurons and their projections involved in social communication 
and language development [29,30]. 

Given the large amount of anatomical and functional connec- 
tivity imaging data, a unified model integrating multiple imaging 
measures may be helpful to better characterize ASD and predict 
the phenotypic outcome [31]. SmaU-world network analysis based 
on graph theory offers some advantages in analyzing complex and 
multidimensional data by providing global and local inter-regional 
modulation information of structural and functional brain 
networks [32]. A potential challenge is the possibility of retrieval 
redundancy ("curse of dimensionality"). Principal component 
analysis is a feature selection method that has been widely used to 
reduce dimensionality by identifying essential components in the 
data structure, while retaining most of the data variation (i.e., 
information) [33]. Maximal relevance and minimal redundancy 
(mRMR) [34] is another feature selection algorithm to reduce 
dimensionality which uses filters based on mutual information and 
correlation to account for maximal dependency and minimal 
redundancy. 

The goals of this study were: i) to evaluate multi-parametric 
fiinctional and structural MRI of the brain in ASD versus TD 
children using small-world network analysis based on graph theory 
to derive local and global efficiency, ii) to use machine-learning 
algorithms to evaluate the ability of these multiparametric MRI 



matrices to classify ASD versus TD groups, and iii) to employ 
machine-learning algorithms of these multiparametric MRI 
matrices to predict ASD chnical phenotypic outcomes, such as 
the revised autism diagnostic interview (ADI-R), autism diagnostic 
observation schedule (ADOS), and intelligence quotient (IQ) 
scores reflecting different aspects of social and learning abilities of 
subjects [35]. 

Methods 

Participants and phenotypic information and MRI 
Imaging 

Data was obtained from the multi-center Functional Coiinec- 
tome Project, which released MRI data of over 500 ASD patients 
(http://fcon_1000.projects.nitrc.org/indi/abide/). We download- 
ed the Autism Brain Imaging Data Exchange database from the 
COINS website (http://coins.mrn.org/) that hosted copies of data 
samples. In accordance with HIPAA guidelines and 1000 
Functional Coniiectomes Project/INDI protocols, all datasets 
were de-identified. Consistent with the policies of the 1000 
Functional Connectomes Project, data usage was approved for 
research purposes. 

127 children with ASD (mean age: 13.5±6.0 years, 24.1% of 
female), 153 age- and gender-matched control TD children (mean 
age: 14.5±5.7 years, 24.8% of female) from the NYU/Yale/ 
Stanford centers were selected for analyses based on age ranges. 
The fuU IQ, scores were slighdy lower in the ASD (104.3±18.9) 
compared to the TD group (1 1 1.7 ± 14.4), with P = 0.001. Imaging 
data consisted of high spatial resolution 3D T 1 -based MPRAGE 
sequence (image size = 160x256x256, resolution = 1x1x1 mm'') 
and resting-state (RS)-fMRI obtained using a gradient echo EPI 
sequence (TR = 2000 msec, resolution =3x3x4 mm'^, 180 vol- 
umes) with 33 slices covering the whole cerebrum. 

The corresponding clinical data from five categories were also 
obtained, consisting of the ADOS scores, the ADI-R, and IQ tests 
(full scale and sub-functionality), social responsiveness scale (SRS) 
and Vinland adaptive behavior scale (VABS) with five domains. 
Detailed information regarding the phenotypic data was outlined 
in previously published articles [10,14]. Briefly, the ADI-R is 
composed of 93 items focusing on the triadic functional domains, 
and administered via interview with categorical results provided. 
The full IQ tests evaluated both verbal IQ (VIQ) and performance 
IQ (PIQ. The ADOS is a semi-structured assessment of social 
affection and communication behaviors, using four modules to 
account for individual expressive language level and chronological 
age. The SRS rating scale measures the social ability of ASD 
children (including total and subdomains of cognition, communi- 
cation, awareness and motivation). The VABS instrument consists 
of social and personal skills for everyday living, including 
subdomains such as interpersonal empathy and socialization. 

Image Processing and Data Analysis 

Preprocessing. Standard preprocessing for RS-fMRI data analysis 
was performed using both FMRIB Software Library (FSL) and 
Analysis of Functional Neurolmages (AFNI) software packages 
with adapted scripts (http://www.nitrc.org/projects/fcon_1000). 
The steps included standard realignment, spatial Gaussian 
smoothing with full width at half maximum (FWHM) of 6 mm, 
band pass temporal filtering of 0.005-0. 1 Hz, co-registration with 
the MPRAGE images, removal of nuisance signals (motion 
parameters, the global signal, and signals derived from cerebro- 
spinal fluid and white matter), and non-linear transformation to 
Montreal Neurological Institute (MNI) standard space. For 
instance, in order to control for motion artifacts, regression and 
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Table 1. Significant regional volume differences (P<0.01) comparing children with autism spectrum disorder (ASD) to age- 
matched typically developing (TD) children after supratentorial volume normalization. 





Region and Location 


VI- TD Children 


V2- ASD Cliildren 


P value (••) 


Percentage change %(A1) 


Frontal GM 


Left inferior 


0.852 


0.938 


0.0068 


10.094% 


Left superior 


18.97 


20.05 


0.0010* 


5.693% 


Right superior 


18.18 


19.12 


0.0034* 


5.171% 


Right medial orbital olfactory 


1.204 


1.104 


0.0034* 


-8.306% 


Frontal WM-left lateral orbitofrontal 


5.761 


5.562 


0.0090* 


-3.454% 


Occipital GM 


Left middle 


4.941 


5.292 


0.0056* 


7.104% 


Left superior 


2.623 


2.902 


0.0002* 


10.637% 


Right superior 


3.279 


3.582 


0.0017* 


9.241% 


Parietal GM 


Left postcentral 


3.961 


4.230 


0.0064 


6.791% 


Right superior 


5.482 


5.888 


0.0054* 


7.406% 


Right postcentral 


3.996 


4.319 


0.0020* 


8.083% 


Right precuneus 


6.495 


7.031 


0.0006* 


8.253% 


Parietal WM 


Left postcentral 


5.771 


5.483 


0.0074 


-4.990% 


Right supramarginal 


7.791 


7.410 


0.0054* 


-4.890% 


Temporal WM 


Left fusiform 


5.811 


5.567 


0.0085 


-4.199% 


Right inferior 


5.157 


4.890 


0.0023* 


-5.177% 


Right superior 


5.779 


5.543 


0.0058* 


-4.084% 



Note-Data (VI, V2) are mean brain volumes after normalization to the supratentorial volume with a scale factor of 1000, no unit. 

**Calculated with two-sample t test to obtain original p-value (shown with P<0.01} between two groups, and with Bonferroni multiple region correction (x8 factor 
given 4 brain lobes in two hemispheres). 

*lndicates a significant difference with corrected P<0.05 after Bonferroni adjustment. 

Al Indicates percentage change between volume of ASD children (V2} and volume of TD children (VI), calculated as Al = (V2-V1)/V1*100%. 
doi:1 0.1 371 /journal.pone.0090405.t001 



residual analyses of six motion parameters including three angular 
rotations (roll, pitch, and yaw in units of degrees) and three 
directional displacements (x,y,z in units of mm) were implemented 
in the preprocessing step. 

Volumetry analysis 

Anatomical data were processed to derive the total supraten- 
torial volume, regional subcortical volumes (45 regions), white 
matter (WM) volumes (70 regions), and cortical gray matter (GM) 
volumes (148 regions) using Freesurfer software (version 5.1.0) 
[36]. Regional volume comparisons were implemented after 
supratentorial volume normalization. The cortical thickness of 
148 cortical regions was measured automatically based on minimal 
communication distance between the gray and white matter 
ribbon of the projected cortical surface [37]. Thickness measured 
with Freesurfer has been shown to be especially useful in 
characterizing the cortical folding and cytoarchitecture shaping 
in different age ranges [38]. 

fcMRI analysis 

The seed-based fcMRI was derived with seeds in the bilateral 
pars triangularis and opercularis regions of the IFG, the caudate, 
and the DMN core seed in the MNI 2 mm template space. These 
seed regions or networks were selected based on previously 
reported abnormalities in ASD [12,13,27,28] . The total number of 



voxels (N) and average correlational z-value (Z) were computed 
from the functional connectivity map of each seed using a 
threshold of cluster corrected P<0.05. The statistical multiple- 
comparison correction was done based on Gaussian random field 
(GRF) theory and implemented with the FSL easythresh 
command using empirical minimum Z>2.3 threshold with cluster 
significance of P<0.05. In addition, to examine the functional 
interhemispheric coordination, voxel-mirrored homotopic corre- 
lations (VMHC) were derived. The global VMHC voxel number 
(minimum Z>2.3; cluster significance P<0.05, corrected) and 
average Z-value were then quantified [39]. 

fALFF analysis 

Fractional amplitude of low frequency fluctuation (fALFF) of the 
fcMRI data at 0.01-0.08 Hz, which reflects spontaneous neuronal 
or functional activity, was analyzed with adapted scripts in FSL 
and AFNI [39]. 

Graph theory via small-world network analysis 

Inter-regional correlations of individual fMRI time courses of 
1 1 2 bilateral cortical and subcortical ROIs based on functional 
parcellation in FSL template space [40] were derived. Graph 
theory-based small-world network analysis for absolute and 
relative, local and global elTiciency quantification at different 
connectivity sparsity levels, was then performed for each subject 
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Table 2. Cortical thickness increment (in a degree of 2.5-5%) comparing autism spectrum disorder (ASD) children to age-matched 
typically developing (TD) children {P<0.01). 





Region and Location 


CI- TD Children C2- ASD Children 


P value (**) 


Percentage change%(A2) 


Frontal 


Left anterior cingulum 


2.908 


3.031 


o.ooos* 


4.230% 


Left mid-anterior cingulum 


2.875 


2.985 


0.0004* 


3.826% 


Left transverse pole 


2.982 


3.119 


0.0083 


4.594% 


Occipital 


Left middle 


2.795 


2.939 


0.0002* 


5.152% 


Left superior 


2.341 


2.453 


0.0056* 


4.784% 


Left lingual 


2.215 


2.308 


0.0089 


4.199% 


Left middle lunatus 


2.118 


2.220 


0.0026* 


4.816% 


Right lingual 


2.287 


2.383 


0.0031* 


4.198% 


Right calcarlne 


2.129 


2.231 


0.0023* 


4.791% 


Parietal 


Left Inferior angular 


3.036 


3.166 


0.0016* 


4.282% 


Left inferior supramar 


2.987 


3.107 


0.0004* 


4.017% 


Left rectus 


2.807 


2.903 


0.0089 


3.420% 


Left post-lateral fissure 


2.616 


2.718 


0.0018* 


3.900% 


Left occipital junction 


2.428 


2.531 


0.0060* 


4.242% 


Left postcentral 


2.338 


2.444 


0.0036* 


4.534% 


Right postcentral 


2.311 


2.423 


0.0029* 


4.846% 


Temporal 


Left superior 


2.691 


2.792 


0.0009* 


3.753% 


Right superior 


2.731 


2.806 


0.0089 


2.746% 



Note-Data (CI, C2) are mean cortical thickness in mm. 

**Calculated with two-sample t test to obtain original p-value (shown with PO.Ol} between two groups, and with Bonferroni multiple region correction (x8 factor 
given 4 brain lobes in two hemispheres). 

*lndicates a significant difference with corrected P<0.05 after Bonferroni adjustment. 

A2 Indicates percentage change between cortical thickness of ASD children (C2} and cortical thickness of TD children (CI), calculated as A2 = (C2-C1)/C1*100%. 
doi:1 0.1 371 /journal.pone.0090405.t002 



[32]. Group differences of mean local and global efficiency 
parameters derived from in-house software were compared with a 
2-sample t-test. The inter-regional correlations of group-wise 
cortical thickness based on 148 bilateral cortical regions and 
volumetry measured with Freesurfer (148 cortical regions, as well 
as 115 subcortical and white matter regions) were also derived, 
and small world network analysis based on structural measures 
(including relative and absolute local as well as global efficiency 
parameters) was then performed and compared between the two 
groups. 

Integration nnodel for two-group classification 

To reduce possible classifier overfitting and improve general- 
ization, feature selection was performed in two steps. First, 
principal component analysis was used to decompose the 
covariance matrix of the imaging features using the singular value 
decomposition program in Madab (release 2010b; MathWorks, 
Natick, Mass) [33] after variance normalization. Then the number 
of sorted components based on singular values that contained 99% 
or 95% of the information from the covariance matrix of all 
features was determined. Finally, an advanced feature selection 
algorithm, based on mutual-information and integration of both 
mRMR criteria [34] , was used to select imaging features based on 
the number of features (components) determined via principal 
component analysis. 



A total of 22 quantitative local and global imaging features were 
generated from the structural and functional datasets based on 
preliminary data showing statistical differences between the two 
groups. These features include: i) volumes of 9 local subcortical 
regions (the bilateral caudate, thalami, pallidum, hippocampi and 
cerebellum), ii) 8 features from the fcMRI voxel number N and 
average Z values seeding from the bilateral caudate, bilateral IFG 
pars opercularis and triangularis, and DMN core seeds respec- 
tively, iii) 3 fALFF values (averaged values from the bilateral 
caudate, IFG pars opercularis, and IFG pars triangularis), and iv) 
global VMHC voxel number (corrected P<0.05) and average Z 
value (subtotal of 2 features). 

To differentiate the two groups based on the selected imaging 
features, a total of 67 available classifiers, including support vector 
machine (SVM), Bayes network (BayesNet), radial basis function 
(RBF), and sequential minimal optimization (SMO) algorithms, 
were tested with batch-mode scripts developed in WEKA software 
(http://www.cs.waikato.ac.nz/ml/weka, version 3.6.7). 

To evaluate each classifier, a cross validation method with 
different folds (numbering from 2 to 10) or percentage splits (from 
10% to 90%) was applied to the training data set as well as to the 
whole dataset in WEKA software. After the dataset was randomly 
reordered and splitted into n folds of equal size, one fold was used 
for testing and the other n-1 folds were used for training the 
classifier for each iteration. The performances of these classifiers 
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Figure 2. The functional connectivity network seeding from 
caudate and IFG region are demonstrated. In controls, the 
caudate-cortical network showed positive connections with the frontal 
and subcortical regions, and negative connections with the posterior 
visual and parietal areas (A1). The caudate-cortical network in children 
with ASD showed a primary reduction of cingulum and middle 
temporal connectivity (red color), but increased connectivity in several 
regions including the inferior and dorso-latero frontal areas (blue color, 
A2-A3). The functional connectivity network seeding from the IFG pars 
trianglaris in controls (B1) showed positive connections with the frontal 
and temporal regions and negative connections with the posterior 
visual and superior parietal areas. Difference of connectivity pattern 
comparing autism to controls seeded from pars trianglaris are shown in 
B2 and B3, with some reductions of visual and temporal connectivity in 
autism (red) and increments in the regions including medial temporal 
areas (blue). Functional connectivity seeding from IFG pars opercularis 
is shown in C1-C3, with primarily decreased connectivity in autism in 
the superior frontal regions. Seed regions are shown in the upper left 
images, and all statistical results were obtained with the threshold of 
minimal Z>2.3; cluster significance, P<0.05, corrected. 
doi:1 0.1 371/journal.pone.0090405.g002 



were further assessed in WEKA by using margin curves and 
threshold curves to show the cumulative probability of difTerence 
between true and false positives as a function of population size. 

Correlational analysis between imaging and phenotypic 
data 

Qviantitative analysis of single imaging features of regional 
volume, N and Z from fcMRI, as well as the combined selected 
imaging features were correlated with the clinical data (i.e., 
ADOS, ADI-R, SRS, VABS, and IQ, tests including both full and 
sub-domain scores). Bonferroni adjustment was implemented by 
multiplying the original p-values with both the number of category 
tests (x,5 in this study) and the number of sub-domain tests (e.g., 
xlO for ADOS, x3 for IQ, x5 for ADI-R, x7 for SRS and xl,5 for 
VABS) of each category. 

Integration model of imaging data for phenotypic 
prediction 

To improve previous phenotypic correlation with multipara- 
metric imaging data, regression analysis in WEKA (i.e., prediction 
of clinical data from imaging metrics) between the imaging 
features and clinical data was implemented. A total of 23 available 
advanced regression models were evaluated. These regressors were 
divided into seven families: Kstar, linear and non-linear ZeroR 
rule-based regressors, decision stump trees, and Gaussian process 
classifiers for full data sets. For example, Kstar was taken as an 
instance or memory based classifier using entropy and similarity 
functions with the advantage of updating models based on 
relevance to the previously estabhshed database [41]. 

Results 

Volumetry analysis 

Volume and cortical thickness differences between the ASD and 
TD groups are shown in Figure 1 and Table 1 . A key finding is 
that the ASD group demonstrated significantly larger GM volume 
(with exception of the right medial orbital olfactory region) but less 
WM volume compared to the TD group. Consistent with GM 
volumetric findings, the ASD group also showed significandy 
increased cortical thickness (Table 2). 




Figure 3. A: Small-world network analysis based on fIVIRI data showed similar efficiency (including both local and global) between children with ASD 
and TD controls. Small-worldness analysis based on regional volume (red and blue colors) showed no significant differences of total efficiency (B) and 
global efficiency (C) between children with ASD and controls neither. However, small-worldness analysis based on structural cortical thickness 
measure showed reduced total efficiency (B), specifically decreased global efficiency (i.e. shortest path length) (C) in ASD group (magenta color) 
compared to TD group (cyan color). Error bar denotes standard deviation at each sparsity level after scaling to a random network with the same 
number of degree. 

doi:1 0.1 371/journal.pone.0090405.g003 
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Figure 4. A: Co-variance matrix of 22 quantitative imaging features showed highly-correlated structural metrics (i.e. volumetry) of 9 regions, and 
strong association between average Z-value and voxel number N-value of fcMRI for each region (or global metric). B: principal component analysis 
decomposition of the covariance matrix showing most (>95%) data variation (information) was contained with 4 primary components (pink color) 
and 99% data information was contained in 6 primary components (red color). C: The four imaging features selected via mRMR criteria were right 
caudate volume (1), the fcMRI average Z values seeded from bilateral caudate (2), bilateral IFG pars opercularis (3) and IFG pars triangularis (4). 
Together with the four imaging features selected, the DMN average Z (5) and the VMHC average Z (6) were the other two imaging features selected 
for 99% criteria. 

doi:1 0.1 371 /journal.pone.0090405.g004 



fcMRI analysis 

With seeding in the caudate, the ASD children demonstrated 
reduced functional connectivity in the cingulum and middle 
temporal cortex compared to TD children, but increased 
connectivity in several regions including the inferior and dorso- 
lateral frontal areas (Figure 2A). With seeding in the IFG pars 
triangularis, the ASD group showed reduced fcMRI of visual and 
temporal regions, but increased in the medial temporal areas 
(Figure 2B). With seeding from the IFG pars opercularis, the 
ASD group predominantly exhibited reduced functional connec- 
tivity with the superior frontal regions (corrected P<0.05; 
Figure 2C). 




Ql Instance number 280 0]1 Instance number 280 



Figure 5. Margin curves showing the cumulative probability of 
difference between true positive (sensitivity) and false positive 
(1 -specificity) as a function of instance number (I.e. subject 
number) for 4-feature classification (A) and 6-feature classifi- 
cation (B) based on full dataset. As expected, the 6-feature classifier 
performed slightly better than 4-feature classifier with faster conver- 
gence and more margins maintained. 
doi:1 0.1 371 /journal.pone.0090405.g005 



Graph theory via small-world network analysis 

Small-woridness analysis showed no statistically significant 
difference in the mean total (local and global) efficiency between 
ASD and TD children based on fcMRI (Figure 3A) and 
volumetric data (Figure 3B,C). However, there was reduced 
total efficiency in ASD children (Figure 3B) based on cortical 
thickness, especially in global efficiency, as indicated by increased 
shortest path lengths (Figure 3C). 

Classification of ASD and TD groups 

Co-variance matrix of the 22 imaging features (Figure 4A) 
showed correlated intra-module volumetry of 9 subcortical 
regions, as well as a strong association between the average Z- 
value and the voxel number of fcMRI, both regionally and 
globally. However, the inter-module correlation was not significant 
(r<0.3), suggesting volumetry and fcMRI provided complemen- 
tary information. Singular value decomposition of the co-variance 
matrix (Figure 4B) found that 4 primary components accounted 
for >95% data variation, and 6 primary components accounted 
for >99% data variation. The four primary features were the right 
caudate volume, fcMRI Z values seeded from the bilateral 
caudate, IFG pars opercularis, and IFG pars triangularis; the 
additional two features were the mean Z score maps of the DMN 
and VMHC (Figure 4C). 

For the 6 imaging features selected by the mRMR algorithm, 
the random tree classifier had the highest classification accuracy 
(100%) for correctly identifying ASD patients with the full dataset, 
and 70% accuracy for differentiating ASD patients from TD 
children using 80% percentage split cross validation. Based on the 
4 imaging features, the random tree classifier also had the highest 
accuracy (98%) for the full dataset for two-group classification, 
with 68% accuracy for 10-fold cross validation. The performance 
of the 6-feature classifier was slightly better than that of the 4- 
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Table 3. Significant correlations between each single MRI quantitative metric and phenotypic tests in children with autism 
spectrum disorder {P<0.05), adjusted with number of available patients for each test. 



MRI metric 


Phenotypic Test 


r 


P 


Caudate fcMRI (N) 


ADOS social affection 


-0.44 


0.005 




ADOS research 


-0.43 


0.005 




VABS socialization 


0.35 


<0.0001» 




VABS interpersonal skill 


-0.30 


<0.0001* 




IQ (full) 


-0.47 


0.034 


Caudate fcMRI (Z) 


IQ (full) 


-0.47 


0.034 




VABS socialization 


0.35 


<0.0001» 




VABS interpersonal skill 


-0.44 


<0.0001* 




VABS daily living standard 


0.26 


0.0005* 


IFG triangularis fcMRI (N) 


ADOS module 


-0.57 


0.0016 




ADOS total 


-0.46 


0.014 




ADOS communication 


-0.47 


0.04 




ADOS research 


-0.47 


0.04 




VABS interpersonal skill 


-0.43 


<0.0001* 


IFG opercularis fcMRI (N) 


ADOS module 


-0.64 


0.0002 


IFG opercularis fcMRI (Z) 


ADOS module 


-0.50 


0.007 




VABS socialization 


0.34 


0.0001* 




VABS interpersonal skill 


-0.25 


0.0006* 



Note: N is the total number of voxels and Z is the average correlational z-value computed from the functional connectivity map (fcMRI) of each seed using a threshold of 
cluster corrected P<0.05. 

*lndicates a significant difference with corrected P<0.05 after Bonferroni adjustment, with multiplication factors determined by both the number of category tests 
applied (x5 in this study) and the number of sub-domain tests (e.g., xlO for ADOS, x3 for IQ, and xl5 for VABS) of each category. 
doi:l 0.1 371 /journal.pone.0090405.t003 



feature classifier, with faster convergence and higher margins (i.e. 
better accuracy) maintained (Figure 5). 

Correlation Analysis 

Table 3 summarizes the correlation analysis of the quantitative 
imaging features with phenotypic scores in ASD children, with 
both uncorrected and multiple-comparison corrected p-values. 
Significant correlations were found: i) between the number of 
connections N based on fcMRI from the caudate and the full scale 
IQ,in ASD chUdren (r=-0.47, P = 0.034), ii) between caudate 
fcMRI N and ADOS scales (r<-0.43, P<0.005), iii) between 
caudate fcMRI N and VABS social skill scores (r = 0.35, P<0.001), 
iv) between caudate fcMRI N and VABS interpersonal skills (r = — 
0.3, P<0.001), v) between the caudate connectivity strength, Z 
score and the full scale IQ, (r= —0.47, P = 0.034), vi) between 
caudate fcMRI Z and VABS scores (PO.OOl), vii) between the 
connections (both N and Z) from the two IFG seeds and ADOS 
tests (r£— 0.46, PsO.04), and viii) between the functional 
connections from the two IFG seeds and VABS scores (P<0.001). 

Integration nnodel for prediction 

As for prediction, multiple combined imaging features in each 
domain showed improved correlations with clinical data compared 
to single imaging features for ASD children. For instance, fcMRI 
seeding from the pars opercularis of IFG showed a correlation 
coefficient of —0.64 between ADOS score and single imaging 
features (Figure 6 A1-A5). The correlation coefTicient was 
significantiy improved with four selected imaging features to 
predict clinical phenotypic data. The fitted score using the Kstar 
classifier showed tight coupling (linear slope =1, r=0.95, P< 
0.0001) to the original ADOS module score, with only a few 



outliers (Figure 6B). Similarly, among all regressors, Kstar also 
showed the best predictive power for highly correlating with other 
phenotypic data including full IQ test (r = 0.94, P<0.0001) and 
ADI-R total score (r = 0.97, P<0.0001) when using the four as well 
as six essential imaging features. 

Discussion 

We applied advanced automatic classification and machine 
learning algorithms to multiparametric functional connectivity and 
structural data (i.e., volumetry and cortical thickness) to improve 
characterization and prediction in autism spectrum disorders. The 
major findings of this study were: 1) small-world network analysis 
based on volumetry and fcMRI did not find differences between 
ASD and TD children, while small-world network analysis based 
on cortical thickness revealed reduced total and global efficiency in 
ASD, and 2) through advanced machine learning algorithms, four 
essential imaging features (caudate volume, caudate-cortical 
fcMRI and IFG pars opercularis as well as triangularis fcMRI) 
were able to accurately differentiate the two groups and were 
highly predictive of phenotypic features (e.g. IQ ADOS and ADI 
scores) in ASD. 

Brain volume and thickness differences 

Our findings of GM volume and cortical diickness increases and 
WM volume decreases in ASD compared to TD group are in 
general agreement with prior studies [42-44]. The increased 
cortical thickness in multiple regions throughout the cerebrum, 
together with decreased WM volumes in frontal and temporal 
regions, may represent a reduction in long-distance pathways with 
accompanying local GM volume increments. These changes could 
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Figure 6. Kstar, an instance (similarity) based classifier for regression analysis had the greatest prediction power for ADOS module 
score for ASD children among all the regressors combining 4 selected imaging features. A1-A4: standardized histogram distribution 
showing x-axis as the z-scores of each feature and y-axis being the frequency. A1 -right caudate volume, A2-average Z of caudate fcMRI, A3-average Z 
of IFG opercularis fcMRI, A4-average Z of IFG triangularis fcMRI. A5: phenotypic ADOS module score histogram distribution with x-axis as the original 
score and y-axis being the frequency. B: The correlation coefficient between integrated imaging features and ADOS module score improved largely 
from —0.64 based on single best imaging feature to 0.95 combining 4 imaging features. The fitted score using Kstar classifier with 4 selected imaging 
features showed a tight coupling (linear slope = 1; r = 0.95, P<0.0001) with the original ADOS module score with only a few outliers. 
doi:1 0.1 371/journal.pone.0090405.g006 



lead to a reduction in global efficiency, consistent with EEG/MEG 
results [45,46] supporting the hypothesis of unbalanced networks 
in autism [47]. 

Functional connectivity differences 

Changes in caudate volume, caudate-cortical fcMRI, and IFG 
fcMRI were found to be highly predictive based on advanced 
machine learning algorithms. Most of the published literature on 
classification in autism is based on single imaging measures (such 
as volume, connectivity, or perfusion) [48-50]. Our integrated 
multiparametric model demonstrates markedly improved accuracy 
(i.e., sensitivity and specificity) for classification and prediction. 

Consistent with task-based fMRI results [29,30], we found 
altered IFG fimctional connectivity, suggesting deficits in the 
mirror mechanism in autistic children. Disrupted DMN fcMRI 
and altered interhemispheric functional connectivity are in 
agreement with previously reported studies in ASD [51,52]. In 
addition, we found that caudate-cortical connectivity differed 
between ASD and TD children, and this metric correlated with 
full-scale IQ scores. A recent study reported aberrant functional 
connectivity from the caudate seed involving early developing 
brain areas and implicated developmental derangement of related 



functional brain circuits in ASD begin at a young age (from 7-14 
years old) [14]. 

Small-world network analysis of fcMRI could not distinguish 
between ASD and TD groups, suggesting there might be 
functional compensatory mechanism in autism, including in- 
creased regional functional connectivity and reduced global, long- 
distance connectivity in ASD. These plasticity changes have been 
reported in other functional and structural connectome studies. 
For example, a recent fcMRI study found that there are changes in 
local efficiency in the posterior cingulum associated with autistic 
trait accompanied with changes in local tissue integrity in ASD as 
measured by diffusion tensor imaging [53]. The notion of re- 
routing of brain structural connectivity has also been reported in 
less severely cognitively impaired (IQ>70) ASD patients [54]. In 
particular, they found that these patients utilized more visuospatial 
processing networks as opposed to linguistically mediated path- 
ways. 

Machine learning and prediction 

Changes in caudate volume, caudate-cortical fcMRI, and IFG 
fcMRI were found to be highly predictive based on advanced 
machine learning algorithms. Most of the published literature on 
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classification in autism has been based on single imaging measures 
(such as volume, connectivity, or perfusion) [48-50]. Our 
integrated multiparametric model demonstrates markedly im- 
proved accuracy (close to 100% sensitivity and specificity) for 
classification of ASD from TD groups. 

The PCA and mRMR selected key features (i.e. right caudate 
volume, the average fcMRI Z values seeded from bilateral 
caudate, bilateral IFG opercularis and triangularis parts, the 
DMN average Z, and the VMHC average Z) are consistent with 
previous studies that found changes associated with ASD in each 
of the six imaging features. Our results based on machine learning 
algorithms also suggest that caudate volume and connectivity 
imaging features are important components in the composite score 
for ASD diagnosis. 

Cross-validation minimizes classification error estimation bias 
and is thus primarily used for small sample datasets. As expected, 
the classification accuracy dropped from ~100% for the full 
dataset to ~lQ"/a with lO-fold or percentage-split cross-validation. 
The decreased accuracy in the cross-validation evaluation also 
reflected the heterogeneity of our data sample and a potential 
over-fitting problem. Nevertheless, the remarkable improvement 
in outcome prediction from multiple combined imaging features 
compared to any single imaging features (r>0.94, P<0.0001) in all 
three phenotypic domains supports the notion that these key 
imaging features are reflective of the behavioral deficits in autism. 

Limitations and future works 

Although we used substantially large numbers of classifiers and 
reported the optimal one, our results were limited by the selection 
criteria. Incorporating other factors such as different feature and 
subset selection criteria besides PCA and mRMR may improve the 
classifier performance in the future. The current quantitative 
fcMRI features include local and network-based features. Further 
expanding the fcMRI feature space, such as using whole-brain 
ROI-based fcMRI and fALFF of with dual regression template 
model [39], may reduce ROI-selection bias. Applicatic)n of 
recentiy developed deep learning algorithms [55] to prevent 
feature overfitting problems could provide more comprehensive 
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results. In order to minimize motion-related errors in RS-fMRI 
data, further investigation of motion parameters, such as frame 
displacement with dynamic temporal screening criteria, could be 
implemented in the future [56]. 

The regional parcellation with Freesurfer used in our study has 
been shown to be a robust method. However, the results of direct 
volumetry and cortical thickness could be affected by the 
normalization errors of projecting individual brains to a template 
brain, given the younger age range of our subjects compared to the 
averaged template. Implementing spatial normalization using age- 
specific templates in Freesurfer is likely important. An additional 
limitation is that the clinical phenotypic scales were obtained in the 
first years of life of participants, and not necessarily the period 
during which they underwent MRI scanning. This may potentially 
increase the possibility of mismatch between symptoms and brain 
features. 

Conclusion 

This study applied advanced automatic classification and 
machine learning algorithms to multiparametric structural and 
functional connectivity data to improve characterization and 
prediction in autism spectrum disorders. A major finding of this 
study is that the small-world network anah'sis bastxi on volumetry 
and fcMRI did not find differences between ASD and TD 
children, while the small-world network analysis analysis based on 
cortical thickness revealed reduced total and global efficiency in 
ASD. Moreover, we also found caudate volume, caudate-cortical 
fcMRI and IFG pars opercularis as weU as triangularis fcMRI to 
be highly predictive of phenotypic features in ASD. Graph theory 
and machine learning analysis of multiparametric MRI may prove 
usefiil in identifying using imaging biomarkers for disease 
prognosis and monitoring disease progression in ASD. 
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